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The  purpose  of  this  study  was  to  determine  which  of 
four  selected  methods  v;as  most  appropriate  for  measuring 
change  such  as  gain  in  achievement.  The  four  selected 
methods  v.'ere  ravi  difference.  Lord's  true  gain,  regressed 
gain,  and  analysis  of  covariance  procedures. 

In  order  to  compare  the  four  methods  Monte  Carlo 
techniques  were  employed  to  generate  samples  of  pre  and 
post  scores  for  two  groups.  The  reliability,  variances, 
and  means  of  the  sampled  populations  were  controlled.  One 
hundred  samples  were  generated  at  each  of  20  combinations 
of  five  levels  of  reliability,  Uio   levels  of  group  size, 
and  two  levels  of  gain.  Using  each  of  the  four  methods, 
a  t  statistic  was  calculated  for  each  sample  to  test  the 
null  hypothesis  of  no  difference  in  amount  of  gain  betv/een 
the  two  groups.  The  number  of  t's  significant  at  the  0.05 
level  of  significance  was  recorded  for  each  of  the  four 
methods . 
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At  each  of  the  20  combinations  a  chi  square  test  was 
used  to  test  the  null  hypothesis  of  equal  proportions  of 
significant  t's  among  the  four  groups.   This  hypothesis 
was  rejected  in  each  case.   It  V7as  noted  that  use  of  Lord's 
true  gain  procedure  tended  to  create  a  greater  significance 
level  than  the  user  vrould  intend.   The  proportion  of 
significant  t's  for  each  of  the  other  three  methods  of 
analysis  fell  reasonably  close  to  the  expected  values.  On 
this  basis  use  of  Lord's  true  gain  procedure  vras  not 
recommended  and  since  there  v;as  no  apparent  difference  among 
the  remaining  methods  of  analysis,  none  v;as  recommended 
above  the  other. 
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CHAPTER  I 
INTRODUCTION 


Learning  has  long  been  a  primary  focus  of  investi- 
gation for  educators.  A  definition  of  learning  has  been 
offered  by  Hilgard  (I956): 

Learning  is  the  process  by  which  an  activity 
originates  or  is  changed  through  reacting  to 
an  encountered  situation,  provided  that  the 
change  in  activity  cannot  be  explained  on  the 
basis  of  native  response  tendencies,  maturation, 
or  temporary  states  of  the  organism  (e.  g.  fatigue, 
drugs,  etc  .) .   (p.  3) 

Although  Hilgard  v/ent  on  to  say  that  the  definitio2i 
is  not  perfect,  it  does  illustrate  a  commonly  accepted 
aspect  of  learning:   learning  involves  change  of  the 
behavior  of  the  organism  v/hich  learns  (Bigge,  196^,  p.  1; 
Skinrsr,  1968,  p.  10;  Combs,  1959,  p.  88). 

Educators  have  been  concerned  with  this  change  and 
often  have  sought  to  measure  the  change  occurring  in  some 
situation.  Several  methods  of  analyzing  change  have  been 
presented  in  the  literature.  A  comparison  of  these  methods 
of  analysis  of  the  measurement  of  change  and  the  accompany- 
ing difficulties  v:hich  arise  was  the  focus  of  this  study. 
The  purpose  of  the  study  vms  to  determine  v:hich  of  the 
several  selected  measures  of  change  is  most  appropriate 
under  various  conditions. 


The  Problem  of  Aeasurln/g;  Ghang;e 

In  all  sciences  measurement  is  an  approximation.   In 
conducting  a  first  order  survey,  a  surveyor  makes  three 
measurements  of  a  distance  and  takes  the  average  of  the 
three  measurements .  Physicists  and  engineers  customarily 
report  the  relative  size  of  the  error  of  their  measurements. 
Physical  scientists  have  been  fortunate  in  that  the  size 
of  the  relative  error  involved  in  their  measurements 
often  has  been  small,  frequently  less  than  0.01  and  some- 
times  less  than  10"  .  Educators  are  unfortunate  in  this 
respect  in  that  if  a  student's  true  I.  Q.  v;ere  100  and   an 
I.  Q.  score  of  38  was  observed,  the  relative  error  would 
be  0.14.   The  size  of  this  error  is  not  uncommon  and  larger 
relative  errors  do  occur.   The  Stanford-Binet  intelligence 
test  has  standard  error  of  measurement  equal  to  five  I.  Q. 
points  (Anastasi,  I96I,  p.  200).  Thus,  relative  errors  of 
0.14  or  larger  v;ill  occur  1.64  per  cent  of  the  time, 
assuming  a  normal  distribution  of  errors. 

In  measuring  change  the  proble'ii  is  compounded  since 
there  is  an  error  in  both  the  pre  score  and  the  post  score. 
Moreover,  V7hen  the  miagnitude  of  the  change  is  small  or  zero, 
the  magnitude  of  the  error  may  be  larger  than  that  of  the 
change.   This  possibility  makes  the  change  difficult  to 
detect  or  to  separate  from  the  error.  The  effect  of  this 
error  of  measurement  on  change  scores  was  noted  as  early 
as  1924  by  Thorndike: 


When  the  individuals  in  a  varying  group  are 
measured  twice  in  respect  to  any  ability  by 
an  imperfect  measure  (that  is  one  whose  self- 
correlation  is  below  1.00),  the  average 
difference  betv/een  the  two  obtained  scores 
will  equal  the  average  difference  betv;een 
the  true  scores  that  v;ould  have  been  obtained 
by  perfect  measures,  but  for  any  individual 
the  difference  betvjeen  the  tv/o  obtained  scores 
Vfill  be  affected  by  the  error.   Individuals 
who  a.re  belov;  the  mean  of  the  group  v;ill  tend 
by  the  error  to  be  less  far  below  it  in  the 
second,  and  individuals  v/ho  are  above  the 
mean  of  the  group  in  trie  first  measurement 
v;ill  tend  by  the  error  to  be  less  far  above 
it  in  the  second.  The  lower  the  self -corre- 
lation, the  greater  the  error  and  its  effect. 

Thorndike  (192^)  v/ent  on  to  shov:  that  there  vjas  a  spurious 

negative  correlation  betv/een  initial  true  score  and  ti'-ue 

gain.   He  then  stated  that  "the  equation  connecting  the 

relation  of  obtained  initial  ability  v;ith  obtained  gain, 

the  unreliability  of  the  measures  and  the  true  facts"  had 

not  been  discovered.  Lord  (195^)  developed  the  equation 

to  vrhicli  Thorndike  alluded.  Lord  made  the  following 

assumptions  concerning  the  error  of  measurement:   the 

errors 

i)  have  zero  mean  for  the  groups  tested. 
ii)  have  the  same  variances  for  both  tests, 
iii)  are  uncorrelatod  vxith  each  other  and  vjith 
true  score  on  either  test.   (Lord,  1956) 

McNemar  (1955)  has  extended  Lord's  woi-'k  to  the  case  of 

unequal  ei-ror  variances.   It  should  be  noted  here  that 

McNemar's  follovj-up  of  Lord's  v;ork  is  only  an  extension. 

When  the  variances  are  equal,  McIIemar's  formulas  are 

identical  to  Lord's  (Lord,  195S) . 


Methods  of  Analyzing;  Change 

The  estimated  true  gain  scores  derived  from  Lord's  and 
McNeraar's  equations  have  been  used  in  either  t  tests  or 
analysis  of  variance  procedures  (Soar,  I968;  Tillman,  1969). 
There  are,  in  addition  to  Lord's  method,  three  other 
commonly  used  methods  for  analyzing  change. 

One  method  has  been  the  use  of  a  straight  analysis 
of  variance  or  t  test  on  the  raw  difference  scores  as  the 
situation  warrants.   It  is  important  to  note  that  the  rav; 
scores  are  used  in  the  analysis  with  no  correction  for  the 
mirel lability  of  the  measures. 

A  second  method  is  to  complete  an  analysis  of  covari- 
ance  on  the  rav;  difference  scores  using  the  pretest  scores 
as  covariates ,   These  procedures  are  standard  statistical 
techniques  and  may  bo  fomid  in  many  texts  (Hays,  1963; 
Snedecor,  1956;  v/iner,  I962)  . 

The  third  method  of  measuring  gain  has  been  advocated 
by  Manning  and  DuBois  (I962):  ■  the  method  of  residual  gain. 
In  this  method  the  final  scores  are  regressed  on  the  initial 
scores  and  the  difference  betv;een  the  final  score  and  the 
score  predicted  by  the  regression  equation  is  taken  as  a 
measure  of  gain.   This  measure  is  than  used  in  t  tests  or 
analysis  of  variance  procedures. 

Thus  in  the  case  of  equal  variances  four  common 
methods  of  analyzing  change  have  been   identified: 

1.   Use  of  rav:  gain  scores  in  appropriate 
procedures , 


2.  Use  of  Lord's  true  gain  scores  in  appro- 
priate procedures . 

3.  Use  of  Manning  and  DuBois'  regressed  gain 
scores  in  appropriate  procedures. 

^.   Use  of  analysis  of  covariance  on  raw  gain 
scores  with  pre  scores  as  covariates . 

The  Problem 

A  researcher  faced  with  these  different  methods  and 
a  problem  in  measuring  change  is  confronted  with  a  second 
and  fimda.Tiental  methodological  problem:   which  of  the 
methods  for  measuring  change  is  most  appropriate?   It  is 
this  question  that  this  study  sought  to  ansv;er.   The 
problem  of  v/hich  method  to  use  is  further  complicated  since 
different  writers  have  claimed  that  different  techniques 
were  appropriate.   "Manning  and  DuJBois  and  Rankin  and  Tracy 
feel  that  the  method  of  residual  gain  is  more  appropriate 
for  correlational  procedures  since  it  is  metric  free"  (as 
quoted  in  Tillman,  I969,  p.  2).  Ohnmacht  (1963)  also  sug- 
gested that  this  procedure  vra.s  the  best.  Lord] in  Harris, 
1963,  chapter  2)  mentioned  regressed  gain  but  seemed  to 
advocate  his  ovin   method  as  being  superior.   This  position 
is  further  supported  by  Cronbach  and  Furby  (I97O). 

To  determine  which  of  the  methods  of  analysis  v;as 
most  appropriate,  an  empirical  study  vjas  conducted  to 
compare  the  results  of  each  method  under  knovm  situations. 


Some  Limitations 

This  study  v/as  limited  to  the  two  group  situation. 
To  examine  more  than  two  groups  would  have  involved  such 
a  large  number  of  possibilities  as  to  make  the  study 
impractical  in  terms  of  time  and  money.  Thus,  this  study 
excluded  the  more  general  multigroup  comparisons  possible 
with  analysis  of  variance  and  covariance  procedures  and 
was  limited  to  examining  t  tests  and  the  analysis  of 
covariance  using  the  pre  score  as  covariate . 

Additionally  the  case  of  unequal  variances  between 
the  two  groups  v.'as  not  considered. 

A  third  limitation  vjas  that  the  variance  of  the  true 
gains  was  selected  a  priori  to  be  3-6.  True  gain  scores 
with  this  variance  will  be  such  that  over  99  psr  cent  will 
be  within  five  imits  of  the  true  mean  gain.   It  is  the  ratio 
of  the  variances  of  the  gain  and  error  which  is  important. 
Since  the  reliabilities  were  varied,  as  indicated  later, 
this  study  was  conducted  utilizing  several  such  ratios. 

One  of  the  factors  of  interest  was  the  reliability  of 
the  test  used.  A  second  factor  of  interest  was  the  sample 
size  or  the  relative  pov;er  of  tlie  procedures  under  study. 
There  is  an  infinite  number  of  i^eliabilities  and  a  decision 
was  made  as  to  the  levels  of  reliability  to  be  investigated: 
0.50  to  0.90  in  increments  of  0.10.   Tests  vjith  reliabilities 
lower  than  O.5O  are  rarely  used  in  practice  and  at  best  the 
resulting  data  would  be  highly  questionable.  The  lovjer 
limit  of  0.50  was  chosen  for  this  reason. 


Sample  sizes  of  25  and  100  per  group  were  chosen  as 
being  somewhat  representative  of  sample  sizes  used  in 
educational  research. 

Procedures 

Two  groups  v^ere  compared  under  20  conditions  using  t 
tests  as  f ollov;s  .  There  were  five  levels  of  reliability 
(0.50,  0.60,  0.70,  0.80,  0.90)  and  two  levels  of  sample  size 
(25  and  100)  used  in  this  study.   Thus  there  v;ere  ten  differ- 
ent combinations  of  sample  sizes  and  reliabilities.  For  each 
of  these  combinations  two  cases  vjere  investigated,  one  v;here 
there  was  no  pre  to  post  test  gain  in  either  group  and  the 
second  where  there  was  a  knovm.  gain  from  the  pre  to  the  post 
test  for  one  group.   For  each  of  these  20  instances,  100 
samples  were  generated  and  analyzed  using  each  of  the  four 
methods  of  analysis  indicated  previously.  « 

Consequently,  tv;o  questions  vjere  to  be  answered: 

1.  Does  any  one  of  the  selected  methods  yield  a 
disproportionate  number  of  significant  t  values 
v;hen  there  is  no  difference  between  the  mean 
gain  of  the  tv70  groups? 

2.  Is  any  one  of  the  selected  methods  more  pov;er- 
ful,  i.  e.  more  successful  in  detecting  a 
difference  when  a  difference  does  exist? 

Samples  from  a  normal  distribution  v/ere  generated  using 
techniques  described  by  Rosenthal  (I966)  .  The  method  for 
generating  random  numbers  was  the  multiplication  by  a 
constant  method.  With  the  procedures  used,  this  nethod 
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will  produce  8.5  million  numbers  before  the  series  repeats. 
This  number  vjas  more  than  sufficient  for  this  study.  All 
generation  of  the  samples  and  calculation  of  t  values  using 
the  various  methods  of  analysis  were  done  on  the  IBM  36O/65 
computer  at  the  University  of  Florida.  The  significance 
level  used  for  the  t  tests  was  0.05 . 

The  two  research  questions  generated  two  null  hypo- 
theses: 

1.  The  proportion  of  t's  significant  at  the 
0.05  level  is  the  same  for  each  method  of 
analysis  when  there  is  no  gain  in  either 
group . 

2,  The  proportion  of  t's  significant  at  the 
0.05  level  is  the  same  for  each  method  of 
analysis  when  there  is  gain  by  one  group 
but  not  by  the  other. 

These  hypotheses  were  tested  at  each  of  tv7o  combinations 
of  reliability  and  sample  size  with  chi  square  tests  using 
the  0.05  level  of  significance. 

Significance  of  the  Study 

The  results  of  this  study  should  either  indicate 
empirically  that  one  or  more  methods  vrere  superior  to  the 
others  or  that  there  were  no  great  differences  among  the 
methods.   If  the  former  were  true,  then  educational  resear- 
chers may  select  one  of  tlie  better  methods.   If  the  latter 
were  true,  then  educational  researchers  may  select  any  of  the 


methods.   In  either  case  the  study  provides  some  answer  as 
to  how  change  scores  should  be  analyzed. 

Orp:anization  of  the  Study 

Chapter  I  has  been  the  introduction,  statement  of  the 
problem,  limitations,  hypotheses,  and  procedural  overview. 
Chapter  II  reviews  related  literature,  essentially  the 
development  of  the  equation  and  methodology  of  the  various 
techniques  studied.  Chapter  III  describes  the  procedures 
and  Chapter  IV  presents  the  data,  conclusions,  and  summary. 


CHAPTER  II 
RELATED  LITERATURE 


Much  has  been  said  in  the  literature  about  measuring- 
change.  However,  most  of  this  discussion  is  centered 
around  the  four  methods  investigated  in  this  study:   raw 
gain,  Lord's  true  gain,  regressed  gain,  and  analysis  of 
covariance  procedures.  As  pointed  out  in  Chapter  I  raw 
gain  and  regressed  gain  procedures  are  discussed  in  many 
texts  and  therefore  not  discussed  here.  Lord's  true  gain 
and  regressed  gain  procedures  are  discussed  in  this  chapter. 

Derivation  of  Lord's  True  Gain  Scores 

The  following  derivation  parallels  Lord's  (1956) 
development  of  true  gain  scores  with  one  exception  as  noted. 
Lord  gave  the  following  equations  as  a  model  for  the  observed 
pre  and  post  scores: 

(1)  X  =  T  +  E-^ 

(2)  I  =  T  +  G  +  Eg 
where 

X  =  observed  pre  score; 
Y  =  observed  post  score; 
T  =  true  pre  score; 
G  =  true  gain; 
E-j  =  error  of  measurement  in  pre  observed  score; 
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Ep  =  error  of  measurement  in  post  observed  score. 
Lord  then  made  the  following  assumption  concerning  E,  and 
Ep,  the  errors  of  measurement. 

The  Errors 

i)   have  zero  mean  in  the  group  tested. 

2 
ii)   have  the  same  variance  (o  )  for  both  tests. 

iii)   are  uncorrelated  with  each  other  and  with 
true  score  on  either  test  (Lord,  1956)  . 

The  derivation  can  be  considerably  shortened  at  this  point 

by  examining  a  standard  regression  equation  v/hich  predicts 

one  variable,  X^ ,  from  two  other  variables,  X„,  X„.   The 

equation  is  (Tate,  1965,  p.  171) 

<3'  ^1  =  B12.3  C^  (X2  -  X^)  +  B^3  2  i  (X3  -  X3)  +  x^ 


where 


r   —  V       T 

B     =  ^12   ^13  "^23 

12.3       1    p2 

23 


and 


r   -  r   r 

B     =   -12 12  ^23 

13.2       1  _  ^2 

If  we  let 

X-j^  =  G  ^  gain; 

X  =  X  =  observed  pre  score; 

Xo  =  'if  =  observed  post  score; 
the  following  elements  in  the  regression  equation  can  be 
identified  as 

X-,  =  estimated  gain; 
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Xp  =  mean  of  the  observed  pre  scores; 

Xo  =  mean  of  the  observed  post  scores; 

S-,    =  standard  deviation  of  the  gain  scores; 

Sp  =  standa.rd  deviation  of  observed  pre  scores; 

S„  =  standard  deviation  of  observed  post  scores; 
Tp^  =  correlation  of  observed  pre  and  post  scores. 
Lord  has  pointed  out  that  r-,^  and  r,  ^,  the  correlations 
between  observed  score  and  true  score,  are  the  reliabilities 

Prom  (1)  and  his  stated  assumptions,  Lord  v;rites 

,  r'\  ^2    2  ^  2  ^  2 

(5)  ^y  =  ^t  +  °g  +  %   5 

(6)  ^xy^^t'-ne   • 

Lord  solves  these  equations  to  find 

2    2    2  2 

(7)  o      =   a      -^^  o      +   2a         -    2a ^      , 
w  /        g    X    y     xy     e   ' 

the  variance  of  the  true  gain  scores . 

At  this  point  the  only  element  in  the  regression 
equation  vjhich  is  undefined  is  X  .  This  eleraeiit  is  found 
by  considering  the  mean  of  the  observed  pre  scores  vjhich 
from  (1)  can  be  seen  to  be  equal  to  the  mean  of  the  gain 
scores  plus  t?ie  mean  of  the  errors  of  measurement,  that 
is 

(8)  X  =  T  -i-  E   . 

But  by  Lord's  assumption  (i)  E  =  0,  therefore 
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(9)  X  =  T   .   . 

Similarly  from  (2)  Lord  shows 

(10)  Y  =  T  +  G   . 

Then  (9)  is  subtracted  from  (10)  and  rewritten  to  yield 

(11)  G  =  Y  -  X   . 

Thus  all  elements  are  defined  and  (3)  may  be  rewritten  as 
in  terras  of  T,  G,  X,  and  Y  as 

(12)  G  -  B^2  3  ^^   (^  -  ^)  +  B;L3  2  ^   (Y  -  Y)  +  I  -  X 

X  y 

which  Lord  has  asserted  to  be  an  estimate  of  true  gain. 
It  may  be  noted  that  no  notational  scheme  or  other 
method  has  been  presented  to  distinguish  betvjeen  statistics 
and  parameters  in  the  preceding  derivation.  This  lack  is 
in  keeping  v^ith  Lord's  derivation.   It  is  assumed  here 
that  Lord  v.'as  referring  to  parameters  until  the  point  at 
which  he  obtained  the  final  equation  and  that  he  then 
intended  to  use  sample  values  to  estimate  the  appropriate 
parameters  in  the  regression  equation. 

Comparison  of  Lord's  True  Gain  Scores  with  Other  Scores 

In  his  original  article  Lord  (1956)  made  no  comparison 
of  his  method  v;ith  any  other  method.   In  a  subsequent  artj.cle 
(1959)  Lord  again  made  no  mention  of  other  methods.   In 
a  chapter  written  in  Problems  in  Measuring  Chanp:e  (Harris, 
1963,  Chapter  2)  he  made  reference  to  regressed  gain  scores, 
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but  no  discussion  or  comparison  was  presented.   In  Statis- 
tical Theories  of  Mental  Test  Scores  (Lord  and  Novick, 
1968)  no  comparison  of  Lord's  true  gain  scores  with  other 
procedures  is  presented. 

Comparison  of  Regressed  Gain  Scores  with  Other  Scores 

Manning  and  DuBois  (I962)  have  compared  per  cent, 
rav;,  and  residual  gain  scores.  A  per  cent  gain  score  is 
raw  gain  score-  divided  by  the  pre  score  (Manning  and  DuBois, 
1962)  .   The  comparisons  vjere  made  on  the  bases  of  metric 
requirements,  reliability,  and  appropriateness  of  use  in 
correlation  procedures.  On  each  of  these  bases  residual 
gain  scores  were  recommended  over  per  cent  and  raw  gain 
scores.   Manning  and  DuBois  pointed  out  that  per  cent  and 
raw  gain  scores  require  at  least  equal  interval  scales  on 
both  pre  and  post  scores  and  that  the  scales  be  the  same 
on  both  pre  and  post  scores,  i.  e.  the  same  equal  interval 
scale  must  be  used  on  both  tests  .  According  to  Maniiing 
and  DuBois  these  qualities  are  not  possessed  by  educational 
and  psychological  test  scores.   In  contrast,  residual  gain 
scores  do  not  require  the  same  equal  interval  scales  and 
therefore  are  appropriate  for  use  with  test  scores  (Manning 
and  DuBois,  1962)  .  Manning  and  DuBois  summarily  list 
form.ulas  showing  that  residual  gain  scores  are  more  reliable 
and  more  appropriate  measures  for  correlational  procedures 
than  are  raw  or  per  cent  gain  scores.  These  formulas  were 
only  IJsted,  not  derived,  and  no  reference  v;as  made  to  their 
derivation. 
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Another  Study  and  Summary 

Madansky  (1959)  reported  or  derived  several  methods 
for  fitting  straight  lines  to  two  variables  when  both  were 
measured  with  error.  One  of  these  procedures  is  applicable 
in  the  case  when  the  variance  of  the  error  of  measurement 
is  unknown.  Hov;ever,  there  has  apparently  been  no  attempt 
to  apply  the  method  to  the  analysis  of  change. 

A  search  of  the  literature  has  revealed  no  compara- 
tive empirical  examination  of  the  four  methods  examined 
in  this  study.  Further,  the  advocates  and  authors  of  two 
of  the  reported  procedures,  each  of  whom  has  been  shovjn  to 
know  of  the  existence  of  the  other  procedure,  continue  to 
advocate  their  own  method  even  though  they  offer  no  reason 
or  data  for  this  advocacy.  This  study  should  provide  some 
knowledge  as  to  any  difference  in  the  four  methods. 


CHAPTER  III 
METHODS  AND  PROCEDURES 


Procedures:   An  Overview 

As  stated  in  Chapter  I,  Monte  Carlo  techniques  v^Jere 
employed  to  generate  pre  and  post  test  scores  for  tvjo 
groups.   One  group  is  referred  to  as  the  gain  group,  the 
other  as  the  no  gain  group. 

The  model  for  the  observed  pre  scores  is 


(1)  X  =  T  +  E-j^ 


where 

X  is  the  observed  pre  score; 
T  is  the  true  pre  score; 

E-,  is  a  nornally  distributed  random  error 

2 

with  mean  0.0  and  variance  o      . 

®1 

The  model  for  the  post  scores  is 

(2)  Y  =  T  +  G  +  Eg 

where 

y  is  the  observed  post  score; 

T  is  the  true  pre  score; 

G  is  the  true  gain  from  pre  to  post  score; 

Ep  is  a  norraally  distributed  random  error 

2 

vjith  mean  0.0  and  variance   o 
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The  generated  scores  vrere  subsequently  analyzed  for 
the  difference  in  the  amount  of  gain  or  change  between  the 
two  groups.  The  scores  were  analyzed  by  the  four  selected 
methods; 

1.  at  test  on  the  raw  difference  scores 

2.  at  test  on  Lord's  true  gain  scores 

3.  at  test  on  regressed  gain  scores 

4.  a  t  test  from  an  analysis  of  covariance  on 
the  raw  difference  scores  using  pre  score 
as  covariate. 

The  results  of  these  analyses  v/ere  then  compared.  For  tlie 
gain  group  an  appropriate  mean  gain,  y  ,  from  pre  to  post 
scores  was  obviously  selected  to  be  0.0  in  the  case  of  no 
gain  for  either  group  and  selected  to  be  of  such  size  as 
to  make  the  pov;er  0.50  when  there  was  a  gain  in  the  gain 
group.   The  post  scores  were  generated  by  adding  a  random 
normal  gain,  G,  and  a  random  normal  error,  Ep,  to  the 
generated  true  pre  scores .   The  variables  G  and  Ep  had 
means  V  and  0  respectively  and  variances  as  discussed 

D 

later.   For  the  no  gain  group  there  v/as  no  gain  from  pre 
to  post  scores. 

The  pre  scores  for  both  the  gain  and  the  no  gain 
group  were  taken  from  a  normal  population  with  mean  50.0 
and  variance  100.0.   The  mean  and  variance  of  the  popula- 
tion of  post  scores  for  both  the  gain  and  the  no  gain 
groups  were  functions  of  the  mean  true  gain  and  of  the 
re] iability . 
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After  the  samples  were  generated,  the  hypothesis  of  no 
difference  in  average  gain  between  the  two  groups  was  tested 
using  each  of  the  four  methods.  The  t  values  for  each  of 
these  tests  were  recorded. 

This  procedure  was  repeated  over  100  samples  for  each 
of  the  selected  reliabilities  O.5O,  O.6O,  O.7O,  0.80  and 
0.90.  The  method  for  introducing  the  effect  of  the  selected 
reliability  into  each  generated  score  is  presented  in  a 
following  section. 

Thus  100  t  values  were  calculated  and  recorded  for 
each  method  of  analysis  and  at  each  level  of  reliability. 
This  entire  procedure  was  repeated  for  each  of  the  follovn.ng 
conditions: 

1.-   group  size  =  25,   l^  =  0.0  for  both  groups; 

2.  group  size  =  25,   v  5^  0.0  for  the  gain  group 

O 

=  0.0  for  the  no  gain  group; 

3.  group  size  -  100,  V  =0.0  for  both  groups; 

4.  group  size  =  100,  V-     7^  0.0  for  the  gain  group 

O 

=0.0  for  the  no  gain  group. 
V/here  the  gain  vjas  not  equal  to  0.0  it  vjas  such  that  the 
power  of  the  t  tests  on  the  raw  difference  scores  was  O.5O, 
i.  e.  the  expected  proportion  of  rejected  null  hypotheses 
was  0.50.  The  follov;ing  sections  describe  in  more  detail 
some  of  the  previously  mentioned  procedures .* 


*  The  reader  is  also  referred  to  the  FOSTRAH  listing  in 
Appendix  A  for  the  exact  computer  routines  by  which  these 
procedures  v:ere  carried  out. 
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Samplinp;  from  a  Normal  Population  with  Specified  Mean  and 
Variance 

If  P  is  the  cumulative  density  function  of  a  random 

variable  R, ,  thai  the  random  variable  R_  defined  by 

(3)  B2  "  ^^^1^ 

is  imiforiTily  distributed  over  the  interval  [o,l|  (Heyer, 
1965,  P-  256,  Theorem  13-6) .   Here  P  is  the  cumulative 
density  function  of  the  random  variable  R-,  .   It  follows 
then  that  R-,  ,  where 

(^)  Ri  =  F~^(I^2^  ' 

is  normally  distributed  if  F~   is  the  inverse  cumulative 
density  fujiction  of  a  normal  distribution  and  if  R  is  a 
uniform  random  number  on  the  interval  [o,l]  (Meyer,  I965, 
pp.  256-257) . 

Thus  random  samples  from  a  normal  distribution  may  be 
obtained  using  uniform  random  numbers  and  by  (4)  vihere  P 
is  the  cumulative  density  function  of  a  normal  distribution. 
For  a  normal  distribution,  F~  (R  )  must  be  calculated  using 
numerical  approximation  methods.   This  calculation  as  well 
as  the  generation  of  the  uniform  random  numbers  were  done 
using  a  routine  described  by  Rosenthal  (I966,  pp.  270,  287). 
Rosenthal's  techniques  v^ere  adapted  to  the  IBM  36O/65 
computer  installed  at  the  University  of  Florida  (see 
Appendix  A  FUIICTIOII  R/^KD)  .  The  normal  population  sampled 
had  mean  0  and  variance  1.0.   If  a  different  mean  or  variance 
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was  required,  it  was  obtained  by  addition  or  multiplication 
by  an  appropriate  constant. 

Selecting;  Reliability 

Reliabilities  of  O.5O,  O.6O,  O.7O,  0.80  and  O.9O  were 
selected  as  representative  of  reliabilities  found  in  test 
scores.   The  reliability,  rel,  of  a  test  may  be  defined  as 

(5)  ,      rel  =  1  -  — i    (Nunnally,  I967,  p.  221), 

a 

X 

2  -  2 

where  0  is  the  error  of  measurement  variance  and  o      is 
e  X 

2 
the  observed  score  variance.   Since  0  had  been  selected 

X 

a  priori  to  be  100.0,  we  have  from  (5) 

(6)  o^  =  100  (1  -  rel)   . 

®1 

Moreover,  since 

(7)  X  =  T  +  E^ 

and  since  the  error,  E-,  ,  is  assumed  to  be  independent  of 
the  true  score,  T,  we  have 

(8)  a^   z.  o^   -i-   a^         , 

X    t    e^ 

2 

or,  combining  (6)  and  (8)  and  solving  for  o 

(9)  o?  =  100  -  cj^ 

t         e^ 

For  the  post  scores  the  desired  variances  are  also 
eavSily  found  from  the  model  for  a  post  score, 
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(10)  Y  =  T  +  G  +  E^   , 

and  for  which 


(11)         c"^   ^    c\^   o"^  ^  o"^ 
y    t    g    eg 


2 

As  stated  in  Chapter  I,  oj  was  selected  to  be  3.6.   If  (5) 

D 


is  rewritten  with  a      and  a^   instead  of  a^  and  g    ,   respec- 

y      eg  X      e^; 


tively,  then  (5)  and  (11)  may  be  used  to  find 

(12)        /-     =  -L.  _S  _  o2   o^   . 
e^     i^el      t    g   • 

The  effect  of  the  selected  reliability  may  be  obtained 
by  selectiiig  the  error  of  measurenient  variances  and  the 
variance  of  the  true  scores  in  accordance  with  (6),  (9) 
and  (12) . 

Thus  it  is  seen  that  if  true  scores  are  selected  from 
a  distribution  with  variance  o     and  if  the  errors  of 

1/ 

measurement  are  selected  independently  from  a  distribution 
with  variance  o^,  then  by  (8)  X  has  variance  a^,  if  (7)  holds. 

SelectJ-n/^  Gain 

v;hen  there  was  no  gain  in  either  group  the  value  of  v 
would  then  be  0.0.  When  vi   was  nonzero  for  the  gain 
its  value  was  selected  so  as  to  make  the  power  of  the  t 
test  on  the  va.\i   difference  scores  equal  to  O.50.  The 
power  of  0.50  vjas  selected  in  order  to  permit  maximum 
difference  between  the  four  methods  of  analysis. 
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The  value  of  G  was  determined  by  examining  the 
difference  scores  (D) . 

(13)  D  =  Y  -  X, 

and  from 


(1^)  D  =  (T  +  G  +  E^)  -  (T  +  E^)   , 


or 


(15)  D  =  G  +  E^  -  E-[_   . 

The  elements  in  the  right  side  of  (11)  are  mutually  inde- 
pendent normally  distributed  random  variables  whose  vari- 
ances have  been  found  and  thus 

,    ,,  2    2     2    2 

^      '  d    e-j^    g    eg 

Furthermore  since  the  only  difference  in  (15)  for  the  gain 

and  no  gain  groups  is  the  mean  of  G,  the  variance  of  the 

2 
difference  for  the  gain  group,  o  ^   ,  and  the  variance  of 

g 

2 

the  difference  for  the  no  gain  group,  o   ,  are  equal,  i.  e 

ng 

2     2      2 

(17)         °d  =  ''d   =  ^d  • 

g     ng 

This  common  value  a.   may  then  be  used  to  determine  the 

appropriate  value  of  v-     to  produce  the  desired  power  of 

0.50  for  the  t  test  on  the  raw  difference  scores. 

The  t  test  on  the  rav:  difference  scores  is  found  from 

the  follov;in2;  formula: 
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D 


(18) 


t  = 


g  "  ^ng 


(n  -  1)  S^  +  (n  -  1)  S' 


n^  +  n^  -  2 


J}-E 


n 


-  + 


1_ 


If  the  group  size  is  25  and  reliability  O.5O,  the  value  of 
is  found  as  follows: 

2 

(note:      rcl  =  O.5O   implies o     =  106.72) 


(19) 


t  = 


^^-Q 


2^(2a^    )    +   2ii(2af 


ng 


25  +25-2 


1.    4.   1_ 
25  ■"   25 


t  = 


L 

2T9155 


This  value  of  t  is  greater  than  the  critical  value  of  t 

(2.01)  only  if 

(20) 


D. 


2-°^  ^-279^1  ■ 


that  is,  only  if 


(21) 


5.86  <  D 


g 


Thus  if  a  value  of  5.86  is  chosen  for  the  mean  gain,  the 
power  is  .50.  Appropriate  values  for  other  group  sizes 
and  reliabilities  were  similarly  determined. 

Ajgial^_jj3__of_t}vq_t_  Va^^^^ 

The  number  of  t's  significant  at  the  O.O5  level  v/as 
recorded  for  each  of  the  four  methods  of  analysis.  These 
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data  were  recorded  for  each  of  the  20  combiriations  of  sample 
size,  reliability  and  gain.  A  chi  square  statistic  was 
calculated  for  each  of  these  20  sets  to  test  the  null 
hypothesis  of  no  difference  in  the  proportion  of  signifi- 
cant t  values  for  the  four  methods  of  analysis.  These  data 
may  be  seen  in  Tables  1  and  2  of  Chapter  IV. 


chaptp:r  IV 

BESULTS,  CONCLUSIONS,  AND  SUMMARY 


Results 


The  number  of  significant  t's  for  each  method  of 
analysis  under  the  no  gain  condition  is  presented  in 
Table  1,  and  for  the  gain  condition  in  Table  2.  Addi- 
tionally, the  computed  chi  square  statistics  for  each 
reliability  level  are  given.  In  each  case  the  null  hypo- 
thesis tested  v;as  that  the  proportion  of  significant  t's 
was  the  same  for  each  of  the  four  methods  of  analysis.  The 
chi  square  values  v;ere  computed  from  the  2  x  ^  contingency 
tables  implied  by  the  corresponding  line  of  the  table.  For 
example,  for  group  size  of  25  and  a  reliability  of  0.50, 
the  2  X  ^  contingency  table  implied  by  the  first  line  of 
Table  1  is: 

Rav7    Lord's   Regressed   Analysis  of 
gain    gain      gain      covariance 
Significant         5      58        2  2 

Non  significant    95      42        98  98 

As  may  be  seen  by  inspection  of  Tables  1  and  2,  all  the 
chi  square  values  v-;ere  significant  at  the  0.05  level  and 
in  each  case  the  hypothesis  of  equal  proportion  of  signi- 
ficant t  values  for  the  four  methods  of  analysis  v;as  rejected. 
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TABLE  1 


NUMBER  OP  SIGNIFICANT  t's  WHEN  THE  TRUE  MEAN 
GAIN  V;AS  0.0  FOR  BOTH  GROUPS 


GROUP  SIZE  =  25 


RELIABILITY 

RAW 

LORD'S 

REGRESSED 

ANALYSIS 

CHI 

GAIN 

TRUE 

GAIN 

OF 

SQUARE 

GAIN 

COVARIANGE 

0.50 

5 

58 

2 

2 

163.13 

0.60 

5 

53 

4 

4 

128.93 

0.70 

7 

51 

6 

6 

103.69 

0.80 

5 

2^ 

5 

5 

30.76 

0.90 

6 

27 

4 

4 

40.95 

GROUP  SIZE  =  100 


RELIABILITY 

RAW 

LORD'S 

REGRESSED 

ANALYSIS 

CHI 

GAIN 

TRUE 

GAIN 

OF 

SQUARE 

GAIN 

COVARIANGE 

0.50 

6 

83 

5 

5 

242.95 

0.60 

5 

69 

6 

6 

178.28 

0.70 

6 

55 

7 

7 

115.05 

0.80 

5 

50 

5 

5 

111.60 

0.90 

8 

38 

6 

6 

59.61 

CHI  SQUARE  (3, -95)  =  7-82 
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TABLE  2 


NUMBER  OP  SIGNIFICANT  t's  WHEN  THE  POV/ER  OF  THE 
t  TEST  ON  THE  RAV/  DIFFERENCE  SCORES  WAS  O.5O 


GROUP  SIZE  =  25 


RELIABILITY 

RAW 

LORD'S 

REGRESSED 

ANALYSIS 

CHI 

GAIN 

TRUE 

GAIN 

OF 

SQUARE 

GAIN 

COVARIANCE 

0.50 

l^i\- 

89 

5^ 

5^ 

48.80 

0.60 

52 

89 

6it 

64 

33.00 

0.70 

52 

86 

65 

66 

28.23 

0.80 

^7 

78 

50 

51 

25.43 

0.90 

50 

75 

53 

52 

16.90 

GROUP  SIZE  =  100 


RELIABILITY 

RAW 

LORD'S 

REGRESSED 

ANALYSIS 

CHI 

GAIN 

TRUE 

GAIN 

OP 

SCiUARE 

GAIN 

COVARIANCE 

0.50 

58 

95 

66 

66 

38.80 

0.60 

57 

94 

63 

63 

39.48 

0.70 

50 

82 

58 

53 

24.45 

0.80 

52 

88 

54 

54 

38.37 

0.90 

55 

83 

59 

59 

21.35 

CHI  SQUAP-E  (3,-95)  =  7.82 


28 

Further  inspection  of  Tables  1  and  2  reveals  a  higher 
number  of  significant  t's  for  Lord's  true  gain  procedure 
than  for  any  other  methods.  Moreover ,  examination  of  Table 
1  shovis  that  this  particular  technique  gives  a  considerably- 
greater  frequency  of  significant  t  values  than  one  would 
expect  by  chance.  The  expected  frequency  is  5  for  the  a 
priori  established  condition  of  no  actual  difference  in  the 
tv/o  populations  sampled.  These  results  indicate  that  use 
of  Lord's  true  gain  procedure  tends  to  create  a  higher 
significance  level  than  the  user  would  intend.   If  the 
sample  proportion  of  significant  t's  found  in  the  analysis 
is  used  as  an  estimate  of  the  significance  level,  that 
estimate  is  0.58  for  the  case  v:hen  the  group  size  was  25. 
For  the  same  group  size  the  lowest  estimate  of  the  signi- 
ficance level  is  0.39 • 

Conclusions 

Since  the  hypothesis  of  equal  proportion  of  signifi- 
cant t's  for  the  four  methods  of  analysis  was  rejected  in 
each  of  the  10  cases  v:here  the  mean  gain  was  0.0  and  since 
the  use  of  Lord's  true  gain  scores  provided  estimated 
levels  of  significance  which  vrere  considerably  higher  than 
those  intended,  the  use  of  Lord's  true  gain  scores  is 
stronglj''  susi^ect  and  therefore  is  not  recommended. 

No  apparent  differences  were  found  among  the  remaining 
three  methods  of  analysis.  Hov;ever,  there  is  a  similarity 
betvjeen  the  regressed  gain  scores  procedure  and  t}ie 
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analysis  of  co variance  procedure  that  should  be  examined. 
The  data  in  Table  1  indicate  that  the  same  number  of 
significant  t's  v;as  found  by  both  of  these  methods  in  the 
case  where  there  was  no  gain  for  either  group. 

The  100  t  values  for  each  of  the  four  methods  of 
analysis  when  the  group  size  was  25  and  there  was  no  gain 
in  either  group  are  presented  in  Appendix  B.   Inspection 
of  the  t  values  for  the  regressed  gain  procedure  and  the 
analysis  of  covariance  procedure  reveals  a  striking  simi- 
larity between  the  t  values;  for  each  sample  the  t  values 
are  identical  to  at  least  the  first  decimal  place.  As  a 
descriptive  statistic  it  is  noted  that  the  correlation 
between  the  t  values  found  by  these  two  methods  is  1.00 
(rounded  to  3  digits).  Thus,  the  tvjo  methods  are  providing 
very  similar  results. 

In  contrast  the  correlation  between  the  t's  for  the 
rav/  difference  and  regressed  gain  procedures  is  0.898.   The 
two  methods,  regressed  gain  and  analysis  of  covariance, 
are  not  entirely  similar  to  the  raw  difference  procedure. 
It  may  also  be  seen  from  Appendix  B  that  the  signs  of  the 
t's  from  both  the  regressed  gain  and  analysis  of  covariance 
procedures  sometimes  are  opposite  from  the  sign  of  the  t 
for  the  ravj  difference  procedure. 

Snedecor  (195^,  pp.  397,398)  has  indicated  that  the 
regressed  gain  procedure  and  the  analysis  of  covariance 
procedure  on  the  post  scores  using  pre  score  as  covariate 
are  identical  procedures.  No  source  was  found  indicating  a 
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similarity  between  the  regressed  gain  procedure  and  the 
analysis  of  covariance  procedure  on  the  difference  scores 
using  pre  score  as  covariate.  However,  the  two  methods 
may  be  shown  to  be  equivalent  by  writing  the  linear  model 
for  the  regressed  gain,  or,  equivalently ,  for  the  analysis 
of  covariance  on  the  post  scores  using  pre  score  as  covariate, 
and  the  model  for  the  analysis  of  covariance  on  the  differ- 
ence scores  using  pre  score  as  covariate.   The  model  for 
covariance  analysis  on  the  post  scores  is 

(1)  Y  =  Bq  +  BjX  +   BgZ  +  E   (r4endenhall,  I968,  p.  I70)  , 

where  X  and  Y  are  defined  as  previously  and 

Z  =  1,  if  Y  is  from  the  gain  group, 
=0,  if  Y  is  not  from  the  gain  group. 

E  -  a  normally  distributed  random  error  with  mean  0. 
The  model  for  covariance  analysis  on  the  difference  scores 
is 

(2)  D  =  Bp,  +  B-,X  +  BpZ  +  E 


0  "  ^1^  "  ^'Z' 


where 


(3)       D  =  Y  -  X 

and  all  other  elements  are  defined  as  in  (1).   Now  if  the 
right  side  of  (3)  is  substituted  into  (2)  and  the  resultant 
equation  rearranged  to  yield 

(^0       Y  =  Bq  +  (B-L+  1.0)X  -I-  BgZ  +  E   , 
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it  is  seen  that  (1)  and  {k)    are  identical  except  for  the 
addition  of  1.0  to  B,  of  equation  (1)  and  thus  the  two 
methods  will  yield  the  same  t  values  for  testing  the 
hypothesis  that  Bp  is  equal  to  0.0. 

Since  no  clear  difference  was  found  among  the  raw  gain 
procedure,  the  regressed  gain  procedure  and  the  analysis 
of  covariance  procedure,  none  of  these  is  recommended  as 
more  appropriate  for  the  analysis  of  change  than  the  other. 
All  of  these  three  procedures  are  reconimended  above  Lord's 
true  gain  procedure . 

Discussion 

It  is  reasonable  to  ask  if  there  is  some  questionable 
logic  in  Lord's  derivation  of  true  gain  scores.  Tvio 
things  become  apparent  upon  examination  of  the  derivation. 
First  the  formula  v;hich  Lord  uses  to  begin  his  derivation, 
(3)  of  Chapter  II,  requires  that  the  independent  variable 
be  knovai  exactly  (Madansky,  1959;  Scheffe',  1959,  P-  ^■)  , 
i.  e.  v/ithout  error  of  measurement.  The  problem  of  esti- 
mating true  gain  arises  in  that  the  pre  and  post  test  score.8 
are  not  knovn  exactly,  but  instead  the  observed  vcovez),    or 
the  true  scores  plus  measurement  errors,  are  known  as  may 
be  seen  from  Lord's  models  of  the  observed  scoi'es,  (1)  and 
(2)  in  Chapter  II.   If  true  pre  and  true  post  score  were 
known  these  could  be  put  into  the  regression  equation. 
Hov;ever,  if.  true  pre  score  and  true  post  score  were  known 
there  would  be  no  need  for  the  regression  equation  to 
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estimate  gain.  The  gain  could  be  obtained  simply  by  sub- 
tracting true  pre  score  from  true  post  score.  In  short, 
Lord  seems  to  have  assumed  his  conclusion  in  his  derivation. 
Second,  a  look  at  the  basic  method  for  estimating  true 
gain  is  enlightening.   In  order  to  estimate  true  gain  from 
observed  score  it  is  necessary  to  somehow  remove  the  error 
of  measurement  since,  assuming  the  test  to  be  valid,  this 
is  the  factor  that  obscures  the  true  score.  Mendenhall 
(1968,  p.  1)  says  that  "...statistics  is  a  theory  of  infor- 
mation...-." V/hat  information  is  knovra  concerning  the  errors 
of  measurement?  By  Lord's  assumption  (iii)  of  Chapter  II 
the  errors  are  uncorrelated  with  true  score  and  with  each 
other.  Thus  neither  the  observed  pre  score  nor  the  observed 
post  score  should  provide  any  information  concerning  the 
size  of  the  error.   Since  this  error  is  random  it  would 
seem  that  it  could  not  be  removed  from  the  observed  scores. 
Consider  tvjo  equal  observed  scores,  one  obtained  from  a 
higher  true  score  by  the  addition  of  a  negative  error,  the 
other  obtained  from  a  lov/er  true  score  by  the  addition  of 
an  error  of  the  same  size  but  opposite  sign  of  the  previous 
error.   Hov/  does  one  decide  which  score  is  to  have  a  posi- 
tive correction  added  and  which  is  to  have  a  negative 
correction  added?   It  would  appear  that  one  cannot  make 
this  decision  v/ithout  having  some  information  besides  the 
observed  scores . 
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A  Direction  for  Future  Research 

Since  this  study  shows  no  difference  among  the  propor- 
tion of  significant  t's  for  the  raw  gain,  regressed  gain, 
and  the  analysis  of  covariance  procedures,  it  would  be 
interesting  to  investigate  the  use  of  these  procedures 
under  assumptions  other  than  the  models  listed  in  the  first 
section  of  Chapter  III.  Differences  may  occur,  for  example, 
when  the  gain  is  a  linear  function  of  the  pre  score. 

Summary 

This  study  compared  four  selected  measures  of  change. 
The  four  measures  were:   rav;  difference,  Lord's  true  gain, 
regressed  gain,  and  analysis  of  covariance  procedures.  An 
empirical  comparison  was  made  among  these  four  methods. 
Samples  vjere  generated  using  Monte  Carlo  techniques  and 
the  data  in  each  sample  were  analyzed  by  each  of  the  four 
methods . 

It  V7as  found  that  Lord's  true  gain  procedure  produced 
a  number  of  spurious  significant  t  values,  greater  than 
vjould  be  expected  by  chance,  v/hen  there  was  no  real  differ- 
ence in  amount  of  gain  betvjeen  the  tvjo  populations  sampled. 
No  apparent  differences  were  noted  among  the  remaining 
three  methods  and  these  three  methods  did  not  appear  to 
have  inflated  significance  levels.   With  such  data  use  of 
Lord's  true  gain  procedure  is  not  recommended  and  none 
of  the  remaining  three  methods  vjas  recommended  over  the 
others . 


APPENDIX  A 
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FORTRAN  PROGRAM  WHICH  PERFORMED  THE  CALCULATIONS 


C 
C 


c 
c 
c 
c 
c 
c 
c 


DIMENSION  X{100,2,2),LnRD(100,2),DIFF(100,2) 
1,AMAT(3,3),DIFFX{ 3) ,DUM(3) 
DOUBLE  PRECISION  SEED 

REAL  KR21  ,  LO^C  ,  MPRG,  MPRiMG,  MPOG,  MPONG  ,  MLG  ,  MLNG  ,  MRGG  ,  MRGNG 
READ  (5,1)  N,SEEU,NSAMP,KSAMP,,NOPT 

1  FORMAK 13, FlI .0,314) 
IFCNCPT.EC.O)  GO  TO  3 
READ{5,2)  IREL.ISAMP 

IREL  -  RESTART  RELIABILITY  AT  IREL  FOR  ABORTED  RUN 

I  SAMP  -  RESTART  SAMPLE  NUMBER  AT  I  SAMP  FOR  ABORTED  RUN 

2  for;-' AT  (214) 
GG  TO  4 

3  I  R  E  L  -.  1 
ISAMP=I 

't  CONTINUE 

N  -  SIZE  OF  SAMPLE 

GAIN  -  AVERAGE  GAIN  IN  GAIN  GROUP 
SEED  -  SEED  FOR  RANDOM  NUMBER  GENERATOR 
KSAMP  -  NUMBER  OF  SAMPLES  TO  BE  TAKEN  AT  EACH  LEVEL 
NSAMP  -  NUMBER  OF  TFIE  LAST  SAMPLE  FROM  THE  PREVIOUS  RUN 
-  INCREMENTED  AND  PRINTED  OUT  AS  THE  SAMPLE  NUMBER 
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c 
c 


c 
c 
c 
c 
c 
c 
c 
c 
c 
c 
c 


OF  RELIABILITY 

TT=0.0 

DO  1000  IR=IREL,5 
READ(5,14)  GAIN 
14  FORNAK  F6.'^) 

DO  902  J1=ISAMP,KSAMP 
NSAMP=NSAMP+1 

S  -  SUM 

SS  -  SUV:  OF  SQUARES 

D  -  DIFFERENCE  SCORE 

G  -  GAIN  GROUP 

NG  -  NO  GAIN  GROUP 

PR  -  PRE  SCORE 

PO  -  POST  SCORE 

PP  -  PRE  X  POST 

PRDIFF  -  SUM  PR  X  DIFF 

SDG  =0.0 
SONG  =0.0 
SSDG  =0.0 
SSDNG  =0.0 
SPRG  =0.0 
SPRNG  =0.0 
SSPRG  =0.0 
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c 
c 
c 
c 
c 
c 
c 
c 
c 
c 


SSPRNG=0.0 

SPGG   =0.0 

SPCNG  =0.0 

SSPOG  =0.0 

SSPCNG=0.0 

SSPPG  =0.0 

SSPPNG=0.0 

PRDIFF=0.0 

REL=0.40+  0.10»IR 

SEl=    SQRT(  100.0-SEUSEl  ) 

SX  =  SQRTl 100.0-SE1»SE1  ) 

SE2=  SCRT((SEl»SEl+3.36)/REL-3.3  6-SEleSEl) 

GAIW=TT«SGRT{2.0«(SEleSEl+SE2*Sc2+3.36)/N) 

XII, J, K)    I-STUDENT 

J=l,  PRE  SCORE 
=2,  POST  SCORE 

K=l  GAIN 
=2  NO  GAIN 

DIFF(  I  ,  J)    1=  STUDENT 
J=l,  GAIN 
=2,  NO  GAIN 

DC  10  I=1,N 

l)l  =  SX«RAND(SEEC) 

D2=SX«RANC(SEED) 
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X(I,1.1)=50+D1+SE1»RAND(SEED) 

X(I,2,1)=50+01+GAIN+1.83»RAND(SEE0)+SE2»RAND( SEED) 
X( I,l,2)-50+D2+SEl*RAND(SEED) 

X( I,2,2)=50+D2+1.83»RAND(SEED)+SE2*RAND(SEED) 
DIFF( I, 1)-X( I ,2, 1  )-X(  1,1,1) 
DIFF( I,2)=X( I ,2,2)-X{ I, 1,2) 
SDG=SDG    +DIFF(  I,  1) 
SDNG=SDNG    +CI FF{ I ,2) 
SSOG=SSDG+CIFF( I,l)fiCIFF(I,l) 
SSDNG  =  SSCMG  +  DIFF{  l,2)<fDIFF(I,2) 
SPRG=SPRG    +X(  1,1,1) 
SPRNG=SPRNG    +X( I ,  1,2) 
SSPRG=SSPRG    +X(  I,  1,  1  )»X(  I, 1,  1) 
SSPRNG  =  SSPRlNlG  +  X(  I,1,2)«^X(I,1,2) 
SPOG--SPGG  +  X{  I  ,  2,  1  ) 
SPONG=SPONG+X (1,2,2) 
SSPOG  =  SSPCG  +  X (  I, 2,  1  )  *X(  I  , 2, 1  ) 
SSPCNG=SSPOiMG  +  X(  l,Z,2)<-X{  1,2,2) 
SSPPG^SSPPG+    X(  I,  1,  1  )«^X(  1,2,  1) 
SSPPNG=SSPPNG     <-X(  I,  1,2  )»X(  I,  2,2) 
10    PRDIFF=PRCIFF+0IFF(I,1)»X(I,1,1)+DIFF{I,2)<X(I,1,2) 
.  VAPRG=     (SSPRG-SPRG*SPRG/'-J)/(N-l) 
VAPRNG^     (  S£PRNG-SPRMG*SPRi\G/N)  /(N-1  ) 
VAPOG=     (SSPGG-SPdG*SPCJG/N)/(N-l) 
VAPONC^-     (  SSPGN'G-SPONG*SPi);MG/:\  )  /  (  N- 1 ) 
CPPG-     {SSi^PG-SPRG«SPOG/N)/SQRT{  (  SSPRG- SPRGv  SPRG//J )  *  (  S  5P0G- 
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1SP0G»SP0G/N) ) 

CPPNG=  (SSPPNG-SPRNG*SPONG/N )/SQRT( ( SSPRNG-SPRNG* SPRNG/N) * ( 
1SSP0NG-SPCNG*SP0NG/N)  ) 

DBARG=  SDG/N 

DBARNG  =  SDNG/N 

VADG=  (SSDG-SDG»SDG/N)/(N-1) 

VAUNG  =(SSDNG-SDiNG*SDNG/N)/(N-l) 

Tl=  (CBARG-D3ARNG)/SGRT(  {  (  N- 1  )  *^  (  VADG  + VAONG  )  /  {  2*N-2  )  )»{2.0/N 
1))         . 

B1G=  ( ( ( 1.0-REL)»CPPG«SQRT(VAP0G) ) /SORT ( VAPRG ) -RE L+CPPG*CPP 
IG)/ 
1 ( 1.0-CPPG«CPPG) 

B2G=  (RlL-CPPG>CPPG-(  (  1.0-REL  )«  SQRT  (  VAPRG  )  *CP  PG  )  /SQRKVAPOG 
1) )/(l .0-CPPG«CPPG) 

B1NG=  {((  1.0-REL)  »CPP,NG»SQRT(  VAPONG)  )  /  SQRT  (  VAPRNG  ) -REL  +  CP  PNG 
1«CPP,\G)  )/  (1.0-CPPNG  =  CPPMG) 

B2NG=  (REL-CPPNG»CPPNG-( ( 1.0-REL )* SQRT{ VAPRNG ) »CPPNG ) /SQRT{ 
1VAP0\G)  )/  (  1.0-CPPNG«CPP.MG) 

SLG    =0.0 

SSLG   =0.0 

SLNG   =0.0 

SSLNG  =0.0 

KPRG=SPRG/N 

MPtJG  =  SPGC/N 

MPRNG=SPRNG/N 

MPONG  =  SPCf\G/\ 


ko 


DO  110  1=1, N 

LORDt I ,1)=DBARG+B1G*(X(  1,1,1  )-KPRG)+B2G« (X( I ,2,1)-MP0G) 
LORD( 1,2 )=  08ARNG+81NG*(X{ I, 1, 2 )-MPRNG ) +B2NG* (X(I ,2,2)-MPaN 
IG) 
SLG=SLG  +  LCRD(  1,1) 
SLNG  =  SLNG  +  LORD(  1,2) 
SSLG=SSLG+L0RD(1,1)«L0RD{I,1) 
110  SSLNG=SSLNG+LORO( I,2)bL0RD(I,2) 
MLG=SLG/N 
MLNG=SLNG/N 

VALG= (SSLG-SLG*SLG/N  )/CN-1.0) 
VALNG=(SSLNG-SLNG»SLMG/N)/(N-1.0) 
T2=(NLG-MLNG)/SQ^T( ( (SSLG-SLG*SLG/N)+SSLNG-SLNG«SLNG/N)/(N« 

1(N-1 ) ) ) 

A={SSPPG  +  SSPPNG-(  SPRG  +  SPRNG)<-(SP0G  +  SP0NG)/(2'^N)  )  / 
1  (SSPRG  +  SSPRMG-{SPRG  +  SPRNG)fr(  SPRG  +  SPR^JG  )  /  (  2^11)  ) 

B=(SPCG  +  SPOrJG)/(  2*N)-A*(  SPRG  +  SPRNG  )  /  (  2<iH) 

SRGG^C.O 

SSRGG=0.0 

SRGNG=0.0 

SSRGNG=0.0 

UC  210  I=1,N 

RGSG-X(  1,2,  l)-Ai^X(  I,  1,  1)-B 

RGSNG  =  X{  I,2,2)-A«X(  I,  1, 2)-B 

SRGG=SRGG+RGSG 

SSRGG=SSRGG+RGSG«RGSG 
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SRGNG=SRGNG+RGSNG 
210  SSRG!\G  =  SSRGNG  +  RGSNG*RGSNG 

MRGG=SRGG/N 

NRGNG=SRGNG/N 

VARGG=(SSRGG-SRGG»SRGG/N)/(iM-l) 

VARGNG=(SSRGNG-SRGNG»SRGNG/N)/{N-1) 

T3=  (KRGG-MRGNG)/SQKT( ( SSRGG-SRGG»SRGG/N+ SSRGNG-SRGNG* 
1SRGNG/N)/(N»( N-1)  )  ) 

AMATC  1,  1)-2«M 

AMAT( 1,2)=N 

AMAT( I, 3)^SPRG+SPRNG 

AMAT{2,  n^AMAK  1,  2) 

AMAT(2,2)=N 

AMAT(2,3)=SPRG 

Af-'AT(3,  1  )  =  AMAT(  1,  3) 

AMAT( 3,2)=AKAT(2, 3) 

AMAT(3.3)=SSPRG<SSPRNG 
900  CONTINUE 

CALL     INV(AKAT) 

UIFFX(l)-    SDG  +  SDfJG 

DirFX(2)=SCG 

DIFFX(3)=PRCirF 

YXXXXY=0.0 

DO  410  1=1,3 

DUM( I )=0.0 

DO  4  0  5  J--l,3 


^2 


A05    DUMd  )=.CUN{  I  ) +DI  FFX  (  J  )  *  AMAT  (  I  ,  J) 
A 10    YXXXXY  =  YXXXXY  +  CUM( I )«CIFFXl  I ) 

SSE  =  SSCG  +  SSCrvlG-YXXXXY 

VAACCV=SSE/{2.0*N-3) 

T4=DUN(2)/SGRT ( V AACCV* AMAT ( 2 t 2 ) ) 

AA=(PP[:iFF-{SPRG  +  SPRNG)*f(SCG  +  SDNG)/{2*N))/ 
1  {SSPRG  +  SSPRi\G-{SPRG  +  SPRNG)»(  SPRG  + SPR^JG  )  /  (  2»-fNl  )  ) 

AKG=CEARG-AA'^(KPRG-(I^PRG  +  KPRNG)/2) 

AN'NG=CB/lRNG-AA»(MPRNG-(MPRG  +  yPRNG)/2) 

KR21^1.G-(KPRG«llC0-NPRG))/{ 1C0»VAPRG) 

VvRITE  (6,501)     NSAMP,REL,KR21,  T  1 ,  T  2  ,  T3  t  T^  ,  N-PRG  ,  N  PRNG  ,  MPGG  , 
1NPCNG»MG,KLNG,MRGG,MRGNG,AMC,AMNG, SE E D , VAPRG , VAPRNG , VAPCG , 
2VAPCKG,VALG,VALNG,VARGG, VARGNG, VAACOV 

501  FCRFAT(I6,1X,2(F3.2,  1X),1X,4(F7.3),5(^X,2F6.2)/5X,F23.11, 
116X,A(AX,2F6.1),8X,F6.2) 

V<RITE(7,5  02),\SAMP,REL,KR2  1,T1,T2,T3,T^,MPRG,MPRNG,MP0G, 
I K PCNG, y L G, V LN GtNS AMP, KRGG,MRGNG,AKG,AMNG, VAPRG, VAPRNG, 
2VAPCG,VAPCNGf  V ALG t VALNG , V ARGG , VARGNG , VAACOV 

502  FGR^AT(l6,2F3.2,4F7.3,6F6.2,3X,'I'/I6,'iF6.2,8F5.1,F6.2,3X,' 

12«  ) 

902  CCMINUE 
ISAKP^^l 

1000  CCM  INUE 
STCP 

FUNCTICK  RAKD  (RC) 


^3 


DCUDLE  PRECISICN  RC 

RC=CN'CCIRC*  30  517578125. ,3^359738368.) 

X=RC/34359738368. 

Y=SIGK(  l.C,X-0.5) 

V=SCBTI-2.0*ALCG(0.5»(1.0-ABS(1.C-2,0«X)))) 

RAKC=Y«{V-{2.5  15517+C.8  0  2853«V+.C1032  8* 
lVe*2)/[1.0+l.A32788«V  +  0.189269*Vs«-2  +  0.C0130  8tV*'^3)) 

RETURN 

END 

SUDRGLTINE  INVCA) 

PRCGRAM  FCR  FINCIKG  THE  INVERSE  CF  A  3X3  MATRIX 

CINENSICN  A(3,  3),  L(  3  ),f',(  3) 

CATA  N/3/ 

SEARCH  FCR  LARGEST  ELEMENT 

CC80  K=1,N 

L{K)=K 

K  (K)  =  K 

BIGA=A(K,K) 

CC2G  I=K,N 

CC20  J=K,N 

IF(ABS  (DIGA)-AL'S  (A(I,J)))  ]C, 20,20 
10  BIGA=A( I , J) 

L  {  K  )  -  I 

N(K)-J 
20    CCNTINUE 

INTERCH/iNGE  RCWS 


kk 


J=L{K) 

IF(L(K)-K)  35,35,25 
25  DC30  1=1, N 

HCLC=-A(K, I ) 

A(K,  I  )  =  A( J, I) 
30  A(  J,  I  )  =  l-CLr- 

INTERCt-AKGE  CCLUM\S 
35  I  =  f''lK) 

IF(KIK)-K)  A5,^5,3 
37  CC40  J=1,N 

HCLC=-A( J,K ) 

A(  J,K)  =  A( J,  I ) 
40  A( J,  I  )  =  hCLC 

DIVICE  CCLL'FN  EY  MNUS  PIVOT 
A5  CC55  1=1, N 
A6  1F{  I-K)50,55,50 
50  A(  I  ,K  )=A(I,K) /(-A(K,K )  ) 

55  CONTINUE 
RECUCE  r-'ATRIX 
DC65  1=1, N 
CC65  J=1,N 

56  IF(I-K)  57,65,57 

57  IF(J-K)  60,65,60 

60  A(I,J)=A(I,K)^fA(K,J)^A(I,J) 

65  CONTINUE 

CIVICE  RCVv  BY  PIVCT 
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CC75  J=1,N 
68  IF( J-K)70,75, 70 
70  A(K,  J)=yi(K,  J)/A(K,K) 
75  CCKTINIjE 
C      CCMINUEC  FRUCUCT  CF  PIVOTS 
C      REPLACE  PIVCT  BY  RECIPROCAL 
A(K,K)=1.0/A(K,K) 
80  CCMINUE 
C      FINAL  RCW  AND  COLUMN  INTERCHANGE 
K  =  N 
100  K=(K-I) 

1F(K)  150,150,103 
103  I=L{K) 

IF(I-K)  120,120,105 
105  CCUO  J^l.N 
HCLn=A( J,K) 
A ( J,K )=-A ( J,  I  ) 
110  A( J, I )-hCLC 
120  J=y(K) 

IFIJ-K)  100,100,125 
125  CC130  1=1, N 
HCLC=A(K,  I  ) 
A(K,  I  )=-A( J,  I  ) 
130  A( J,  I  )=FCLC 

GC  TC  100 
150  RETURN 


^6 


END 
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LISTING  OF  t's  FOR  EACH  METHOD  OF  ANALYSIS 
WITH  THE  GROUP  SIZE  25  AND  GAIN  0.0 


SAMPLE 

RAW 

LORD'S 

REGRESSED 

ANALYSIS 

NUMBER 

GAIN 

TRUE 
GAIN 

GAIN 

OF 
COVARIANCE 

1 

0.334 

0.935 

0.597 

0.592 

2 

-0.052 

-0.175 

-0.257 

-0.254 

3 

1.428 

3. 180 

1.340 

1.330 

k 

0.888 

3.837 

1.441 

1.453 

5 

O.318 

0.925 

-0.153 

-0.152 

6 

0.005 

0.013 

0.903 

0.906 

7 

2.297 

25.767  . 

0.899 

0.986 

8 

0.288 

1.746 

0.252 

0.249 

9 

-0.904 

-2.464 

-1.383 

-1.370 

10 

-1.008 

-1.691 

-1.013 

-1.003 

11 

-0.186 

-0.664 

-0.243 

-0.241 

12 

-0.051 

-0.040 

-0.255 

-0.253 

13 

-0.232 

-1.325 

-0.161 

-C.159 

14 

-0.535 

-2.182 

-0.959 

-0.958 

15 

2.239 

10.504 

1.702 

1.720 

16 

-0.575 

-2.038 

0.143 

0.152 

17 

-1.297 

-2.736 

-0.717 

-0.725 

18 

-0.533 

-2.794 

-0.269 

-0.267 

19 

0.674 

4.600 

0.638 

0.632 

20 

-0.954 

-5.083 

-1.758 

-1.767 

21 

0.235 

0.937 

0.379 

0.375 

22 

-0.036 

-0.075 

-0.867 

-0.864 

23 

0.451 

2.305 

0,564 

0.558 

24 

-1.324 

-2.824 

-1.661 

-1.648 

25 

-0.856 

-2.220 

-O.518 

-0.516 

26 

1.240 

4.498 

0.735 

0.739 

27 

0.221 

0.910 

-0.112 

-0.112 

28 

0.006 

0.019 

-0.163 

-0.161 

29 

0.165 

0.609 

0.250 

0.247 

30 

0.076 

0.251 

0.164 

0.163 

31 

-0.610 

-2.692 

-0.920 

-0.912 

32 

1.242 

2.416 

1.441 

1.427 

33 

-0.696 

-2.768 

-0.608 

-0.603 

34 

0.410 

2.427 

0.225 

0.224 

35 

-0.076 

-0.379 

-0.254 

-0.252 

36 

-0.670 

-2.169 

-0.894 

-0.884 

37 

2.034 

6.352 

1.791 

1.790 

38 

1.216 

7.274 

0.966 

0.961 

39 

-0.029 

-0.143 

O.8I3 

0.829 

40 

-1.205 

-3-964 

-1.347 

-1.334 

41 

0.361 

0.995 

0.729 

0.722 
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LISTING  OF  t's  (CONTINUED) 


SAMPLE 
NUMBER 


EAV/ 
GAIN 


LORD'S 

TRUE 

GAIN 


REGRESSED 
GAIN 


ANALYSIS 

OP 

COVARIANCE 


42 

43 
44 

45 
46 

47 

48. 
49 
50 

51 
52 

57 

58 

|9 
60 

61 

62 

63 

64 

65 
66 

67 

68 

69 
70 

71 
72 

73 
74 
75 
76 
77 
78 

79 

80 
81 
82 

83 
84 

85 


1.042 
1.556 
0.511 
-1.295 
0.669 
0.382 
0.228 
0.922 

-1.263 

-0.276 

0.496 

1.385 
-0.087 

-1.594 
0.489 

-1.450 
1.146 
1.754 

-0.538 

-0.434 

1.155 
1.321 

0.130 

0.721 

-0.556 

0.173 

-0.1^6 

1.290 

-1.904 

0.572 

0.923 

-0.034 

0.123 

-0.586 

-2.622 

-0.034 

-0.259 

-0.597 

0.065 

0.614 

0.729 

-0.967 

-1.275 

1.259 


1.762 
16.696 

3.033 
-7.504 

4.977 
1.210 
0.216 
5.810 
-3.021 
-0.763 

1.975 
6.511 

-0.187 

-2.591 

3.276 

-3.217 
4.044 
7.198 

-8.090 

-2.971 
3.037 

20.233 

1.233 

2.349 
-1.149 

0.332 
-0.465 

4.543 
-4.395 

5.132 

2.998 
-0.106 

0 .  546 
-2.404 
-8.344 
-0.078 
-0.303 
-2.141 

0.197 
1.834 

0.579 
-2. 813 
-5. 166 

4.430 


0.975 
1.728 
0.184 

-1.783 
1.026 
0.640 

-0.110 

0.943 
-1.114 
-0.941 
0.757 
1.583 
-0.161 
-1.646 
-0.233 
-1.298 

0.657 

1.535 
-0.255 
-0.083 

1.766 

1.620 
-0.008 

0.514 
-0.084 

0.434 
-0.496 

1.526 
-1.858 

0.470 

1.140 
-0.004 

0.026 
-0.920 

-2.917 
-1.368 

-0.257 
0.577 

-0.404 
0.162 
1.078 

-1.069 

-1.077 
0.797 


0.966 
1.710 
0.183 

-I.78O 
1.026 
0.634 

-0.110 

0.933 
-1.106 
-0.933 
0.749 
1.572 
-0.160 
-1.630 
-0.239 
-1.290 

0.657 
1.530 

-0.253 
-0.083 

1.749 

1.604 
-0.003 

0.510 
-0.034 

0.432 
-0.493 

1.514 
-1.844 

0.466 

1.128 
-0.004 

0.026 
-0.911 

-2.895 
-1.332 
-0.254 

0.605 
-0.412 

0.162 

1.073 
-1.058 

-1.071 
0.802 
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LISTING  OF    t's    (CONTINUED) 


SAMPLE 

RAW 

LORD'S 

REGRESSED 

ANALYSIS 

NUMBER 

GAIN 

TRUE 
GAIN 

GAIN 

OF 

GOVARIANCE 

86 

-1.736 

-16.635 

-0.996 

-1.010 

87 

-0.835 

-^•328 

-0.406 

-0.408 

88 

-0.956 

-^.608 

-0.633 

-0,630 

89 

0.930 

3.109 

0.924 

0.915 

90 

0.551 

1.689 

0.579 

0.573 

91 

0.121 

0.420 

0.167 

0.165 

92 

1.352 

9.886 

1.212 

1.204 

U 

2.337 

12.778 

2.149 

2.145 

0.921 

2.005 

0.315 

O.8O9 

^ 

0.599 

3.843 

0.442 

0.438 

96 

0.756 

\M\5   ' 

■   1.030 

1.020 

97 

0.155 

0.659 

-1.004 

-1.019 

98 

-0.091 

-0.168 

-0.162 

-0.160 

99 

-1.^^9 

-57.894 

-0.194 

-0.204 

100 

0.125 

0.638 

-0.218 

-0.216 

BIBLIOGBAPHI 


Anastasl,  Anne  Psycholop;ical  Testing;.  New  York: 
Hacmillan,  19bl. 

Bigge,  Morris  L.  Learning;  Theories  for  Teachers .  New  York: 
Harper  and  Rov?,  196^ 

Combs,  A.  W.,  Snygg,  Donald  Individual  Behavior.  Nev;  York: 
Harper  and  Row,  1959. 

Cronbach,  Lee  J.,  Furby,  Lita  "How  Should  We  Measure 

Change--or  Should  We?"   Psychological  Bulletin.   1970, 
7^:   68-80. 

Hays,  William  L.   Statistics  for  Psychologists.  Nev?  York: 
Holt,  Rinehart,  and  Winston,  I963. 

Hilgard,  Ernest  R.   Theories  of  Learning.  New  York: 
Appleton,  1956. 

Lord,  P.  H.   "The  Measurement  of  Growth,"  Educational  and 
Psychological  Measurement,   1956,  16:   ii-21-'537. 

Lord,  P.M.   "Statistical  Inferences  about  True  Scores," 
Psychopietrika,   1959,  24:   1-17 . 

Lord,  P.  M.   "Elementary  Models  for  Measuring  Change,"   in 
C.  W.  Harris  Problems  in  Measuring  Change .  Madison, 
Wisconsin:   University  of  Wisconsin  Press,  I963, 
pp.  21-38. 

McNemar,  Q.   "On  Grov;th  Measurement,"  Educational  and 
Psychological  Measurement,   1958,  18:  "^7-55. 

Madansky,  Albert  "The  Fitting  of  Straight  Lines  When  Both 
Variables  Are  Subject  to  Error,"   Journal  of  the 
American  Statistical  Association,   I959,  5^:   173-205. 

Manning,  Winton  H.,  DiiBois,  Philip  H.   "Correlational 

Methods  in  Research  on  Human  Learning,"   Perceptual 
and  Motor  Skills,   I962,  15:   287--321 . 

Mendenhall,  William  Introduction  to  Linear  Models  and  the 
Design  and  Analysis  of  Sxi-73riments .  Belmont , 
California:  Wadsv-forth,  r968T' 


51 


52 

Meyer,  Paul  Introductory  Probability  and  Statistical 

Application.  Reading,  Massachusetts:   Addison-V/esley , 

Nunnally,  Jura  C.  Psychometric  Theory.  New  York:  McGraw- 
Hill,  1967. 

Ohnmacht,  Fred  U.   "Correlates  of  Change  in  Academic 

Achievement,"   Journal  of  Educational  Measurement. 
1968,  5:   ^1-^^": 

Rosenthal,  Myron  R.  Numerical  Methods  in  Computer 

Program.ming.  Homevmrd,  Illinois:   Irwin,  I966. 

Scheffe',  Henry  The  Analysis  of  Variance.  New  York:   V/iley, 
1959. 

Schick,  George  B.  (ed) ,  May,  Merril  M.  (ed)   The  Psychology 

of  Reading.  Milwaukee:   The  National  Reading  Conference, 
Inc.,  15th  Yearbook,  I969,  Ranking,  Earl  P.,  Jr.,  Dale, 
Lothar  H.   pp.  17-26. 

Skinner,  B.  F.  The  Technology  of  Teaching.  New  York: 
Appleton,  19b8. 

Snedecor,  George  W.   Statistical  Methods.  Ames,  lovja: 
Iowa  State  College  Press,  1955. 

Soar,  Robert  S.   "Optimum  Teacher-Pupil  Interaction  for 
Sum.mer  Grovith,"   Educational  Leadership  Research 
Supplement,  I968,  26(37:   275^280, 

Tate,  Merle  V/ .   Statistics  in  Education  and  Psychology. 
Nev;  York:   Macmillan,  I965. 

Thorndike,  E.  L.   "The  Influence  of  Chance  Imperfections  of 
Measures  Upon  the  Relation  of  Initial  Score  to  Gain 
or  Loss,"   Journal  of  Experimental  Psychology,   192^-, 
7:   225-232. 

Tillm.an,  Chester  E.  Crude  Gain  VS.  True  Gain:  Correlates 
of  Gain  in  Reading  after  Remedial  Tutor inp.; .  Doctoral 
dissertation.  University  of  Florida^  T^^S. 

Winer,  B.  J.  Statistical  Principles  in  Experimental  Desi.gn. 
New  York:   McGraw-Hill,  19^2"^ 


BIOGRAPHICAL  SKETCH 

John  Howard  Neel  was  born  July  27,  19^^,  at  Waynesburg, 
Pennsylvania.  He  graduated  from  William  R.  Boone  High 
school,  Orlando,  Florida,  in  June,  1962.  In  August,  I965, 
he  received  the  degree  Bachelor  of  Arts  with  a  major  in 
mathematics  from  the  University  of  Florida.   He  taught 
algebra  and  general  mathematics  at  John  F.  Kennedy  Junior 
High  School  from  September,  I965,  until  June,  I966.   In 
September,  I966,  he  enrolled  in  the  College  of  Education 
at  the  University  of  Florida  under  a  United  States  Office 
of  Education  fellowship  program  directed  by  Dr.  V/ilson  H. 
Guertin.   In  September,  I96B,  he  accepted  a  research 
assistantship  under  the  same  program.   In  June,  I96S,  he 
received  the  degree  Master  of  Arts  in  Education.  He  vias 
an  instructor  in  the  College  of  Education  at  the  University 
of  South  Florida  from  September,  19^8,  until  August,  I969, 
and  is  currently  on  leave  from  that  position.   In  September, 
1969,  he  was  appointed  Interim  Instructor  in  the  College  of 
Education  at  the  University  of  Florida  and  he  holds  that 
position  currently. 

John  Howard  Neel  is  married  to  the  former  Carol  Lynn 
Raraft.  They  have  tv;o  daughters,  Sarah  Elizabeth  and  Lia 
Suzanne . 


53 


5^ 

John  Howard  Neel  is  a  member  of  the  Florida  Educational 
Research  Association,  The  American  Educational  Eesearch 
Association,  Phi  Delta  Kappa,  and  the  American  Statistical 
Association, 


This  dissertation  v;as  prepared  uri.der  the  direction  of 
the  chairmen  of  the  candidate's  supervisory  committeG  and 
has  "been  approved  by  all  raombei's  of  that  conirnitteG.   It  V7as 
submitted  to  the  Dean  of  the  ColJ.ege  of  Education  and  to 
the  Graduate  Council,  and  vjas  approved  as  partial  fulfill- 
ment of  the  requirements  for  the  degree  of  Doctor  of 
Philosophy. 
August,  1970 


d,iL.i 


?V6  /X/O^ 


'4^ 


Dean,   Go3/lege,'  of  Education 


Supervisory  Coninittee: 


^/ LA^i  i-y^   7  /  > 
'Chairman/'   '^ 


Dean,   Graduate  School 


Co -Chairman 


ij^ 


•  /^ 


u.. 


7  9  4  6  B 


